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Abstract 

A search for a neutral Higgs boson decaying to a pair of b quarks, and produced in 
association with at least one additional b quark, is presented. Multijet final states with 
three jets identified as originating from b quarks, at least one of which may include a 
non-isolated muon, are studied. The data used in this analysis correspond to an inte- 
grated luminosity of 2.7-4.8 fb _1 , collected by the CMS experiment in proton-proton 
collisions at the LHC at a center-of-mass energy of 7 TeV. This search is particularly 
sensitive to Higgs bosons in MSSM scenarios with large values of tan /3. No excess 
over the predicted background from standard model processes is observed. Stringent 
upper limits on cross section times branching fraction are derived and interpreted as 
bounds in the MSSM tan ft and Ma parameter-space. Observed 95% confidence level 
upper limits reach as low as tan jS m 18 for Ma ~ 100 GeV. 
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1 Introduction 

The electroweak symmetry breaking mechanism of the standard model (SM) predicts the ex- 
istence of a neutral scalar boson, the Higgs particle. A boson has been recently discovered, 
with a mass around 125 GeV (H |2 and properties consistent with those expected for the SM 
Higgs boson. However, its exact properties and the detailed structure of the Higgs sector still 
need further investigation. Moreover, the mass of the Higgs boson is quadratically divergent 
at high energies J3j. Supersymmetry [4] is a well known extension to the SM which allows the 
cancellation of this divergence. 

In contrast to the SM, the Minimal Supersymmetric Standard Model (MSSM) [5] features two 
scalar Higgs doublets, giving rise to three neutral Higgs bosons, collectively denoted as (p, and 
two charged ones, H ± . Two of the neutral bosons are CP-even (h, H) and one is CP-odd (A). In 
this context, the recently discovered boson with a mass near 125 GeV might be interpreted as 
one of the neutral CP-even states. At tree level, two parameters, conventionally chosen as the 
mass of the pseudoscalar Higgs boson Ma and the ratio of the vacuum expectation values of 
the two Higgs doublets, tan jS = Vi/v\, define the Higgs sector in the MSSM. For tan f> larger 
than unity, the Higgs field couplings to up-type particles are suppressed relative to the SM, 
while the couplings to down-type particles are enhanced by a factor of tan ft. In addition, the 
mass Ma is expected to be nearly degenerate with either Mh or Mh- Therefore, the combined 
cross section of Higgs boson production in association with b quarks is effectively enhanced 
by a factor ~2 tan 2 /3. Moreover, the decay into b quarks has a very high branching fraction 
(?»90%), even at large values of the Higgs boson mass Ma- The sensitivity for a SM Higgs 
boson search for the corresponding channel is negligible given the small cross section. 

Recent results at the Large Hadron Collider (LHC) on the (p — ^ XX decay mode [6, 7J provide 
stringent constraints on tan f5, complementing previous results from the LEP experiments |8l 
and superseding those from the Tevatron experiments p HTT) . Similar searches in the <p — > bb 
decay mode have also been performed by the CDF and DO experiments ffl"2| at the Tevatron 
collider. An excess of events of ~ 2 standard deviations with respect to the expectations from 
SM background have been reported by both experiments for a resonance in the mass range 
100-150 GeV. 

In this Letter we present a search for MSSM neutral Higgs bosons produced in association with 
at least one b quark, and decaying into a pair of b quarks. Prospects for this channel at the LHC 
have been studied in Refs. |Tl3l IT4l . This analysis is performed using 2.7-4.8 fb -1 of proton- 
proton collisions with a center-of-mass energy of 7TeV collected in 2011 by the Compact Muon 
Solenoid (CMS) detector at the LHC. The dominant background is the production of heavy- 
flavor multijet events containing either three b jets, or two b jets plus a third jet originating 
from either a charm or a light-flavor parton, which is misidentified as a b jet. 

A signal is searched for in final states characterized either purely by jets ("all-hadronic") or 
with an additional non-isolated muon ("semileptonic"). Events are selected by specialized 
triggers that include online algorithms for the identification of b jets to tackle the large multijet 
production rate at the LHC. The common analysis strategy is to search, in events identified as 
having at least three b jets, for a peak in the invariant mass distribution of the two leading b 
jets, i.e. those having the largest transverse momentum, over the large multijet background. 
A key point of both analyses is the estimation of the background using control data samples, 
which is addressed with different methods. The two analyses reach similar sensitivity to the 
MSSM Higgs scenarios described. The corresponding data sets are largely exclusive, and the 
small overlap is removed for the combined results. 
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3 Event reconstruction and simulation 



2 The CMS experiment 

The central feature of the CMS detector is a superconducting solenoid of 6 m internal diame- 
ter, providing a magnetic field of 3.8 T. Within the field volume, the inner tracker is formed 
by a silicon pixel and strip tracker. It measures charged particles within the pseudorapidity 
range \rj\ < 2.5. The pseudorapidity is defined as t] = — ln(tan(0/2)) and 9 is the polar angle, 
while (p is the azimuthal angle in radians. The tracker provides an impact parameter resolu- 
tion of approximately 15 ]im and a resolution on transverse momentum (pj) of about 1.5% for 
100 GeV particles. Also inside the field volume are a crystal electromagnetic calorimeter and a 
brass/ scintillator hadron calorimeter. Muons are measured in gas-ionization detectors embed- 
ded in the iron flux return yoke. Muons are measured in the pseudorapidity range < 2.4, 
with detector planes made using three technologies: drift tubes, cathode-strip chambers, and 
resistive-plate chambers. Matching muons to tracks measured in the silicon tracker results in 
a transverse momentum resolution between 1% and 5%, for pj values up to 1 TeV. Extensive 
forward calorimetry complements the coverage provided by the barrel and endcap detectors. 
A more detailed description of the CMS detector can be found in Ref. |15X 

3 Event reconstruction and simulation 

The CMS particle-flow event reconstruction l!T6l [17) is used for optimized reconstruction and 
identification of all particles in the event, i.e. electrons, muons, photons, charged hadrons, and 
neutral hadrons, with an extensive combination of all CMS detectors systems. 

The reconstructed primary vertex with the largest p T -sum of its associated tracks is selected 
and used as reference for the other physics objects. 

Jets are reconstructed using the anti-fcj algorithm l|T8l| from particle-flow objects with a radius 
parameter R = 0.5 in the rapidity-azimuthal angle space. Each jet is required to have more 
than one track associated to it, and to have electromagnetic and hadronic energy fractions of at 
least 1% of the total jet energy. Additional proton-proton interactions within the same bunch 
crossing (pileup) affect the jet momentum reconstruction. To mitigate this effect, a track-based 
algorithm that removes all charged hadrons not originating from the primary interaction is 
used. In addition, a calorimeter-based algorithm evaluates the energy density in the calorime- 
ter from interactions not related to the primary vertex, and subtracts it from the reconstructed 
jets in the event. Additional jet energy corrections l|T9t are applied. 

Muons are reconstructed using both the inner silicon tracker and the outer muon system [20J, 
and by performing a global track fit seeded by signals in the muon system. 

The combined secondary vertex (CSV) algorithm |2"Tf is used in the offline identification of b 
jets. The CSV algorithm uses information on track impact parameter and secondary vertices in 
a jet combined in a likelihood discriminant that provides a good separation between b jets and 
jets of other flavors. Secondary-vertex reconstruction is performed with an inclusive vertex 
search amongst the tracks associated to a jet |22 | . 

Simulated samples of signal and background events were produced using various event gener- 
ators and including pileup events. The CMS detector response is modeled with GEANT4 [23J. 
The MSSM Higgs signal samples, pp — > bb(p+X, (p — > bb, were produced with PYTHIA v6.424 [24], 
which yields the pj and tj distributions of the leading associated b jet in good agreement with 
the NLO calculations [25]. The multijet background events were produced with PYTHIA and 
alpgen [26], while for tt + jets events the MadGraph [27J event generator was used. For all 
generators, the CTEQ6L1 [28 1 parton density function (PDF) has been used. 
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4 All-hadronic signature 

We search for the Higgs boson in events where the three leading (highest-px) jets are all b- 
tagged. A signal would be identified as a peak in the invariant mass distribution of the two 
leading jets. Events in the data with only two b tags among the three leading jets are used as to 



model the background, after proper reweighting, as described in Section 4.2 
4.1 Trigger and event selection 

The large hadronic interaction rate at the LHC poses a major challenge for triggering. Events 
are accepted if either two or three jets are produced in the pseudorapidity range \rj\ < 2.6 and 
have px above certain thresholds. Data taking was divided into three categories according to 
the trigger selections. The first (second) category is characterized by dijet triggers in which the 
leading jet is required to have px > 46 (60) GeV, and the next- to-leading jet pj > 38 (53) GeV. 
The third category is similar to the first but requires a third jet with px > 20 GeV. These re- 
quirements were needed to keep the trigger rates manageable as the instantaneous luminosity 
of the LHC increased during the 2011 data taking period. The online identification of b jets 
is performed by an algorithm based on the impact parameter significance of the second most 
significant track associated to the jet as the b-tagging discriminant. Only events with at least 
two jets passing the online b-tagging requirement are accepted by the trigger. 

While the jet px thresholds in the trigger had to be raised with increasing luminosity the trig- 
gers with lower thresholds allow for better exploration of the low-mass region with more re- 
laxed jet px requirements also at the analysis level, albeit with smaller integrated luminosity. 
The inclusion of the higher-threshold triggers allows higher integrated luminosity, but with the 
adjusted analysis requirements only the medium to high mass region can be covered. For this 
reason two analysis scenarios are defined: in the low-mass scenario (Md> < 180 GeV), events 
accepted by the low jet px threshold triggers (first and third categories) are selected correspond- 
ing to an integrated luminosity of 2.7 fb -1 . In the medium-mass scenario > 180 GeV), a 
combination of dijet triggers with low and high jet px thresholds (first and second categories) 
forms an event sample with an integrated luminosity corresponding to 4.0 fb _1 . 

Events are required to have at least three reconstructed jets with \t]\ < 2.2, motivated by the 
balance between b-tag efficiency and mis-tag probability. The three leading jets must also 
pass the px cuts of 46, 38 and 20 GeV (60, 53 and 20 GeV), respectively, in the low- (medium-) 
mass scenario. A minimal separation of AR > 1 between the two leading jets, where AR = 
\J [At]) 2 + (A<p) 2 and At] and Acp are the pseudorapidity and azimuthal angle differences be- 
tween the two jets, is required to suppress background from gluon splitting to a b-quark pair. 

We define a "triple-b-tag" sample to search for a signal by requiring all three leading jets to pass 
a tight CSV b-tagging selection requirement, consistent with the online b-tagging demand, at 
a working point characterized by a misidentification probability for light-flavor jets of about 
0.1% at an average jet px of 80 GeV. The total numbers of events passing the trigger and offline 
selections are 106 626 and 89 637 for the low- and medium-mass scenarios, respectively. The 
efficiency of the trigger for signal events passing the offline selection is 47-67%, for a Higgs 
boson mass in the range of 90-350 GeV. 

We define a "double-b-tag" sample, which is instrumental in estimating the shape of the back- 
ground, where only two of the three leading jets have to pass the above-mentioned criteria, 
while the remaining untagged jet does not have to fulfill any b-tagging requirements. 

The secondary-vertex mass, namely the invariant mass calculated from all tracks forming the 
secondary vertex, provides an additional separation between b-, c- and light-flavor jets (at- 
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4 All-hadronic signature 



tributed to u, d, s, or g partons) beyond the CSV b-tagging selection requirement. A compact 
b-tagging variable for the whole event is constructed by assigning to each selected jet j, where 
] is the rank of the jet in the order of decreasing pj, an index Bj which can take one of three 
possible values. For jets with no reconstructed secondary vertex, or where the secondary ver- 
tex mass is below 1 GeV, Bj is set to zero. For intermediate values of the vertex mass between 
1-2 GeV the index is set to 1, and for vertex mass larger than 2 GeV it is set to 2. The three 
indices B\, Bi, and B3 are combined in an event b-tag variable X123, which is defined as follows: 
X123 = X12 + X3, where X12 = 0, 1 or 2 depending on whether B\ + Bi < 2, 2 < Bi + B2 < 3 or 
#1 + B2 > 3, respectively and X3 = if B3 < 2, and X3 = 3 otherwise. 

By construction, the event b-tag variable X123 can have six possible values ranging from to 5. 
The intention of this mapping is to have each bin populated with sufficient statistics. For event 
types with a strong triple b-tag signature, the X123 distribution typically shows peaks at values 
of 2 and 5. 

4.2 Background model and signal extraction 

The dominant background comes from Quantum Chromodynamics (QCD) multijet production 
with two or more jets containing b hadrons, and can neither be fully reduced by kinematic 
selection, nor reliably predicted by Monte Carlo (MC) simulation. For this reason, a method 
based on control data samples, similar to the one used in Ref. ||29| is applied. The background 
model is constructed from templates that are derived from the double-b-tag sample. 

We divide the events in the double-b-tag sample into the following categories: bbx, bxb, and 
xbb, depending on the rank, sorted by pj, of the untagged jet, which is represented by the 
lower-case letter x. The ranking in descending pj of the three jets is incorporated in the nomen- 
clature adopted here, e.g. bbx means a sample of events where the two leading jets are b tagged 
and the third jet is the untagged jet. The true flavor of the untagged jet can be either light (u,d,s 
flavor quark, or g, denoted collectively by q), charm (c) or bottom (b). 

From these three double-b-tag categories, nine background templates are constructed by weight- 
ing each untagged jet with the b-tagging probability assuming that its true flavor corresponds 
to either a light parton (u, d, s, or g, denoted by Q), a charm (C) or a bottom (B) quark. The con- 
vention is that the capital letter indicates the assumed flavor of the untagged jet. The b-tagging 
probability for each flavor is determined as a function of jet pj and rj with simulated multijet 
events. Data/MC scale factors for the b-tagging efficiencies of b, c, and light flavor jets are 
applied where appropriate l(2"Tf . 

Each background template is a distribution in the two-dimensional space spanned by M12, the 
dijet mass of the two leading jets, and the event b-tag variable X123. 

The following nine background templates are thus created: Qbb; Cbb; Bbb; bQb; bCb; bBb; 
bbQ; bbC; and bbB. In the bbb background events, two bb pairs are present. As pointed out 
in Ref. 1129 1 , the template bbB models mainly bbb events in which the two leading b quarks 
originate from the same bb pair in the event, while Bbb and bBb are important to cover cases 
where the two leading b quarks originate from different bb pairs. 

The X123 dimension of the templates is modeled in a similar way. Each of the three possible 
values of the secondary-vertex mass index of the untagged jet is taken into account with a 
weight according to the probability that a jet will end up in a given bin of the secondary-vertex 
mass distribution. These probabilities, parametrized as a function of the jet pj and rj, have been 
determined for each flavor using jets from simulated tt events. 
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Some of the nine templates are similar to each other in shape both for M12 and X123. In the 
cases where one of the two leading jets is untagged, e.g. Qbb, and bQb, the templates are 
combined, resulting in a merged template (Qb)b = Qbb + bQb. By analogy, also (Cb)b and 
(Bb)b are obtained. When the third-leading jet is the untagged one and the assumptions of its 
flavor are either Q or C, the bbQ and the bbC templates are combined to form the template 
bbX. The total number of templates to be fitted to the data is therefore reduced from nine to 
five, namely (Bb)b, (Cb)b, (Qb)b, bbB and bbX. The projections of the M12 and X123 variables 
are shown in Fig.[l]for the five background templates and for the low-mass scenario. 

CMS 2011, L = 2.7 fb" 1 ,\fs = 7 TeV CMS 2011, L = 2.7 fb'\\fs = 7 TeV 

' • i i i i | i , i i | i ,_ 

All-Hadronic Analysis 
Low-Mass Scenario _1 

(Bb)b ; 

— (Cb)b ~ 
(Qb)b : 

— bbB ~E 
bbX : 
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Figure 1: The Myi (left) and X123 (right) projections of the five background templates, (Bb)b, 
(Cb)b, (Qb)b, bbB, and bbX, for the low-mass scenario. 

Templates whose dijet mass spectra resemble each other can be clearly distinguished with the 
introduction of the event b-tag variable X123. This is the case for example between (Bb)b and 
(Cb)b. In general, the event b-tag significantly improves the discrimination among all flavor 
components modeled. 

The background templates whose projections are shown in Fig. [T] include two additional cor- 
rections. The basic assumption of the background model, that the double-b-tag sample (bb) 
consists entirely of events with at least two genuine b jets, is only approximately correct. Al- 
though the remaining contamination from non-bb events is indeed very small, the impact of 
the b-tagging selection could lead to distortions of the background model and a correction must 
be applied. This contamination is estimated directly from the data using a negative b-tagging 
discriminator [30] constructed with a track-counting algorithm based on the negative impact 
parameter of the tracks, ordered from the most negative impact parameter significance upward. 
The set of events in the double-b-tag sample in which at least one of the b-tagged jets passes a 
certain threshold of the negative b-tagging discriminator is used as a model for the contamina- 
tion by non-bb events. The threshold is calibrated as a function of jet pj with simulated multijet 
events, such that the negative tag rate equals the mistag rate. With this method, the non-bb con- 
tribution is found to be at the level of 3-4%. This correction results in only a marginal change in 
template shape. A second correction is necessary because the online b-tagging patterns differ 
in the double- and the triple-b-tag samples. The correction is determined from simulation, and 
is applied by appropriate weighting of the events in the double-b-tag sample. 

A signal template is obtained for each considered value of the Higgs boson mass by perform- 
ing the full selection on the events of the corresponding simulated signal sample. The mass 
resolution for combinations where both b jets stem from the Higgs decay ranges from 12-14% 
over the mass range of 90-350 GeV. In addition, combinations with at least one of the b-tagged 
jets originating from other sources contribute to the signal mass spectrum. The fraction of true 
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5 Semileptonic signature 



combinations within lc of the mass resolution increases from 50-90%. Similar figures apply 
also for the semileptonic signature discussed in Section [5] 

The signal is extracted by fitting a linear combination of signal and background templates, 
N bbb (/sig^sig + Li /bgd T bgd)' to the observed histogram in M 12 and X 123 space, where N bbb is the 
total number of selected triple-b-tag events, T^ d and T S i g are the above-mentioned background 

and signal templates, each normalized to unity, and and / S i g are the background and signal 
fractions determined by the fit. The results of the fit are discussed in Section [7] 

5 Semileptonic signature 

In the semileptonic signature, as for the all-hadronic one, a signal is searched for in events with 
three identified b jets, as a peak in the invariant mass distribution of the two leading jets. The 
expected background distribution and normalization is built using the same distribution for 
events with three jets of which only one or two are tagged as b jets, reweighting the events 
with a probability derived from a control region and computed with two different techniques. 
The muon requirement in the final state reduces the absolute signal efficiency, since it selects 
events where at least one of the b quarks decayed semileptonically in the muon channel, but it 
helps to reduce the event rate at the trigger level, allowing for a lower threshold for the jets. 

5.1 Trigger and event selection 

The data used in the semileptonic analysis were collected using different trigger selections, to 
cope with the increasing luminosity. All the triggers required a muon with a pj > 12 GeV 
threshold and the presence of one or two central jets (\t]\ < 2.6 ) with transverse momentum 
above a given threshold (20 or 30 GeV, depending on the data- taking period). Furthermore, one 
or two b-tagged jets are required online. Initially the track with the second-most significant im- 
pact parameter was used. Later, when a second online b-tag was introduced, the selection was 
on the first track, in order to retain enough signal efficiency even with this tighter selection. An 
integrated luminosity of 4.8 fb _1 has been analyzed, and about 1.67 x 10 7 events were collected. 

The offline analysis requires a muon with pj > 15 GeV, at least three jets with \rj \ < 2.6, having 
transverse momentum pj > 30 GeV for the first two and pj > 20 GeV for the third one. The 
separation between any pair of jets has to be AR > 1. The two leading jets must be b-tagged 
using the CSV b-tagging algorithm with a working point giving mistag probability for light jets 
of about 0.3%. The muon must be contained in one of the two leading jets. The final selection 
for the signal search adds the requirement that the third jet is also b-tagged, with a looser CSV 
b-tagging selection requirement, corresponding to a mistag probability of about 1%. The total 
number of events which pass the selection is 60 195. 

The relative efficiencies of the triggers with respect to the offline selection criteria were mea- 
sured using lower-threshold single-muon triggers. These efficiencies are found to be about 
45-60%, depending on the Higgs boson mass and the trigger. 

5.2 Background determination and signal extraction 

As in the all-hadronic final state, the major backgrounds for the semileptonic final state are 
multijet events from hard-scattering processes. Other background processes, such as tt + jets 
and Z — > bb + jets, are predicted by the MC simulation to be less than 1% of the total back- 
ground. Other possible backgrounds from events with multiple vector bosons (ZZ, ZW, WW) 
are negligible. 
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Two methods, both derived from data, have been developed to predict the expected back- 
ground. The first is based on the computation of b-tagging probabilities of the third jet; and 
the second is based on a nearest-neighbor-in-parameter-space technique. They are able to pre- 
dict the yield and shape of the multijet background as well as other minor contributions. The 
two methods use completely exclusive data samples, so their two predictions are independent. 
The first method uses double-b-tag samples (bbj) and the second uses single-b-tag samples (bjj) 
with the double-b-tag events removed. 

Both methods require a background-rich sample to serve as a control region. We construct a 
discriminating variable with a likelihood ratio, using various kinematic inputs: the pj of the b 
jets; separation in cp and f] of the b jets; separation in (p and rj between the third jet and the com- 
bination of the two leading jets; and the b jet multiplicity. Two versions of this discriminating 
variable are used: one for the low-mass region (M<p < 180 GeV) and another for the medium- 
mass region (Md> > 180 GeV). For both mass ranges, the control region is defined as the sample 
of events having a low value for the discriminating variable, where the background is enriched 
and the signal depleted, and the signal region is defined as the complementary sample. The 
number of events in the signal region is 33 366 and 16 866, for the low- and medium-mass re- 
gions, respectively. 

The first method, henceforth called the matrix method, uses the probability that a jet is iden- 
tified as a b jet to predict the background in the signal region. The predicted distribution of 
any observable x in the three-b-jet sample can be calculated by rescaling, on an event-by-event 
basis, the same distribution for the bbj sample by the probability of the third jet to be b- 
tagged. Taking into account the contribution of b, c, and light jets, the probability expands as: 
= £ b • fb + £ c ■ fc + e q ■ (1 — /b — fc), where £b, c, q are the probabilities (or b-tagging efficien- 
cies) for a jet to be b-tagged, if it is originated from b, c, or light parton, respectively. The /b, c 
are the probabilities that the third jet originates from the corresponding quark, which depend 
on the kinematics of the event and of the third b jet. 

The b-tagging efficiencies are taken from the MC simulation and checked with several methods 
derived from data H3"T| . Data/MC scale factors (close to unity within a few percent) are applied 
for the efficiencies and the corresponding uncertainties used as systematics uncertainties. The 
efficiencies are parametrized as functions of the third jet pj, tj and charged-particle multiplic- 
ities. The quark flavor fractions are obtained directly from data by a simultaneous fit of two 
flavor-sensitive observables, using templates built from simulated events with b, c and light 
quarks. The first variable used is a b-tag discriminator which uses the confidence level that the 
four tracks with the highest impact parameter in the jet are consistent with originating from 
the primary vertex. The second is the invariant mass associated with the secondary vertex, if it 
has been reconstructed. The parametrization for quark fractions also depends on the angular 
separation between the three b jets. Only events in the control region are used to obtain the 
quark fraction, which is then used to predict the background in both control and signal region. 

The second method, called the nearest-neighbor method, exploits the fact that the probability 
for an event to appear signal-like depends on several event and jet variables. Events from the 
background enhanced control region are categorized according to several such variables, and 
are used to create a multi-parameter background prediction. The method uses the bjj sample, 
and determines, for each event, the probability to pass the final selection. Starting from the bjj 
sample, excluding the bbj events, we can identify four disjoint subsets with which we work: 
(1) bjj (including bjb) in the control region, (2) bbb in the control region, (3) bjj in the signal 
region, and (4) bbb in the signal region. The sum of the above sets corresponds to the initial 
sample, where the leading jet is always b-tagged. We call collectively "training sample" the 
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sum of subsets (1) and (2), and "testing sample" the sum of (3) and (4). The probability that an 
event in the testing sample passes the full selection is estimated by considering a larger sample 
of "similar" events in the training sample, and counting how many of these events pass the full 
selection. 

For each event in the testing sample, referred to as test events, we select a sample of events 
with similar kinematics in the training sample. The probability for a test event to pass the 
final selection is calculated by selecting a sample of 100 training events inside a hyper-ellipsoid 
in the multi-dimensional space of event and jet observables, centered at the test event. The 
training events are chosen as those having the smallest multi-dimensional distance D, where 
D 2 = YliL\ w 2 (xj test — x ; tramm §) 2 , Xj is the variable defining the test and training event, and 
ny is the number of variables. The weights W\ account for the different dispersions of the 
variables and their different sensitivity to the b-tagging probability. A total of ny = 14 different 
variables are used, including pj, tj and the charged-particle multiplicity of the jets, the angular 
separation between the jets, and the invariant mass and transverse momentum of the combined 
jet-jet system. The weights vo\ are computed from the derivative of the probability for an event 
to pass the final selection as a function of the variable %{. We then compute the numbers of 
bbb and bjj events inside this training sample. Finally, the probability for test events to have 
three b-tagged jets is computed as the ratio of bbb to bjj events in the training sample, using 
a weighted average, with 1 / D 2 as weight. The probability obtained this way is then applied, 
event-by-event, to the sample of test events to predict the invariant mass distribution of the 
sample passing the final selection. This method gives a prediction of the background shape in 
the signal region independent from that obtained with the matrix method. 




M 12 [GeV] M 12 [GeV] 



Figure 2: Invariant mass of the two leading jets for data in the control regions, for low- (left) 
and medium-mass (right) regions. Predictions with matrix (black histogram), with nearest- 
neighbor methods (green histogram), and data (red dots) are overlaid. The predictions are 
normalized to the data. 

The background predictions for the invariant mass distribution of the two leading jets from the 
two methods described above are shown in Fig. |2j They are compared with the actual distribu- 
tion in events with three b-tagged jets in the control region (low value of the discriminator), for 
low- and medium-mass regions. The predictions are normalized to the number of events seen; 
the absolute normalization of the prediction will be discussed in Section |6j 

Because the matrix and nearest-neighbor methods use exclusive data samples, we can combine 
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Table 1: Systematic uncertainties on the signal yield from the various sources listed in the first 
column. The following two columns list the resulting uncertainties in the all-hadronic and 
semileptonic analyses. The upper group is for the signal, the lower for the model-dependent 
limits. A range indicates the variation across the probed Higgs boson mass values. The source 
with + also affects the background, while those with * only affect the model-dependent results 
in the space of the MSSM parameters Ma and tan /3. The sources labeled with "rate" affect only 
the total signal yield, those with "shape" also the shape of the signal. 



Source 


All-hadronic Semileptonic 


Type 


Trigger efficiency 


10% 


3 - 5% 


rate 


Online b-tagging efficiency 


32% 




rate 


Offline b-tagging efficiency 


10-13% + 


12% 


shape /rate 


b-tagging efficiency dependence on topology 


6% 




rate 


Jet energy scale 


1.4-6.8% 


3.1% 


shape /rate 


Jet energy resolution 


0.6-1.3% 


1.9% 


shape /rate 


Muon momentum scale and resolution 




1% 


rate 


Signal Monte Carlo statistics 


1.1-2.6% 


rate 


Integrated luminosity 




2.2% 


rate 


PDF and a s uncertainties 


3-6%* 


2.7^.7%* 


rate 


Factorization and renormalization QCD scale 


6 


-28%* 


rate 


Underlying event and parton showering 




4%* 


rate 



their results. This is done by performing a weighted average of their bin-by-bin predictions, us- 
ing the statistical uncertainties as weights (w = 1/uf). In case the x 2 of the average is greater 
than 1, (\fx^ — 1) • 0j is used, bin-by-bin, as an additional systematic uncertainty, following the 
Particle Data Group prescription [32|. 

6 Systematic uncertainties 

Various systematic uncertainties on the expected signal and background estimates affect the 
cross section estimation and, consequently, its interpretation within the MSSM. In both analy- 
ses the main source of systematic uncertainty on the estimated signal yield comes from uncer- 
tainties related to jet reconstruction and b tagging. The second source is the turn-on behavior of 
the trigger efficiency, given the rather low thresholds used in the event selection. Other sources 
include uncertainties on the integrated luminosity and lepton identification. The theoretical 
cross sections used for the MSSM interpretation are subject to factorization and renormaliza- 
tion scale uncertainties, uncertainties due to the choice of parton distribution functions and a s , 
and uncertainties from the underlying event and parton shower modelling 1133 1 . These uncer- 
tainties affect only the computation of the upper limits for the MSSM parameter tan f> from the 
cross section results. The systematic effects directly affecting the signal efficiency, hence the 
cross section and MSSM interpretation, are summarized in Table [l] 

There are systematic uncertainties that affect only the all-hadronic or semileptonic analyses. In 
the all-hadronic analysis, Table [I] includes systematic uncertainties related to the efficiency of 
the online b-tag selection relative to that applied offline, and to a slight dependence of the b- 
tagging efficiency on the jet topology. Various uncertainties also affect the shapes of the signal 
and background templates used in the fit. Shape-altering effects from uncertainties on the jet 
energy scale, jet energy resolution, b-tagging efficiency and mistag rates are accounted for in the 
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7 Results 



fits with nuisance parameters. For the background templates, only the latter two are relevant. 
In the following we quantify background-related systematic uncertainties by their effect on 
the estimated signal fraction / S j g (defined in Section [4b . The uncertainty arising from the jet 
energy scale and the b-tagging efficiency on the template shape increases the / S i g uncertainty 
by typically 0.1-0.4%; the corresponding effect from the jet energy resolution uncertainty is 
0.1-0.3%. Additional shape-altering systematic uncertainties arise from the impurity of the 
double-b-tag sample and the online b-tagging correction to the background templates shape. 
The contribution of the former to the / S i g systematic uncertainty ranges between 0.1-0.3% in 
the mass range 90-130 GeV, and is below 0.1% elsewhere. The effect of the latter correction 
ranges from 0.1% to 0.4% in the mass range 90-160 GeV, and is below 0.1% elsewhere. The 
statistical uncertainty on the offline b-tagging efficiency values is propagated into the templates 
and accounted for in the fitting procedure. The impact on the / S i g uncertainty is typically in the 
range 0.1-0.6%. 

In the semileptonic analysis there are uncertainties on both the background shape and nor- 
malization. The shape-related uncertainty is inferred by the comparison of the background 



predictions obtained with the two methods described in Section 5.2 The corresponding un- 
certainty scaling is included on a bin-by-bin basis in the binned maximum likelihood fit to the 
distribution of the final observable. The background normalization uncertainty has two com- 
ponents: the first is related to the level of agreement between the predicted M\2 distribution 
and the actual bbb one in the data control region and the second is related to the extrapolation 
of this prediction from the control region to the signal region. The ratio between the predicted 
M12 distribution and the actual bbb one in the control region as seen in the data is used to 
normalize the prediction in the signal region, and its uncertainty is used as a systematic un- 
certainty. The scale factor is 0.877 ± 0.007 for the low-mass region and 0.885 ± 0.006 for the 
medium-mass region. For the extrapolation from the control region to the signal region, the 
MC simulation shows a constant ratio between the predicted M12 distribution and the actual 
bbb one in the signal region. The additional correction is 1.01 ± 0.04 and 1.02 ± 0.05 for the 
low- and medium-mass regions, respectively. The systematic uncertainties for the background 
normalization are 4.4% and 5.0% for the low- and medium-mass ranges, respectively. 



7 Results 

In the all-hadronic analysis, we first test the background-only hypothesis by performing a fit 
without including a signal te mp late, but only a linear combination of the background tem- 



plates, as described in Section 4.2 The coefficients are free parameters, but are constrained 



to be positive. Results are shown in Fig. [3p-|3f for the low- and medium-mass scenarios. The 
background model fits the data well within the uncertainty propagated from the templates 
(hatched area). According to the fit, the templates associated with production of three b jets 
provide the dominant contribution to the background. 

Subsequently, a signal template is included together with the background templates in the fit, 
with its fraction / s j g also allowed to vary freely. The fit is performed for Higgs boson masses 
from 90 to 350 GeV. The fit for a Higgs boson mass of 200 GeV in the medium-mass scenario is 
illustrated in Fig.[3ji. 



The semileptonic analysis uses a binned likelihood fit to the invariant mass distribution of 
the two leading jets in the event to extract a possible MSSM Higgs contribution. Two differ- 
ent background predictions are considered, for the low- and medium-mass regions, which are 
fitted separately. In the fit the shape and normalisation of the background component are con- 
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Figure 3: Results from the all-hadronic analysis. Top row: Result of the background-only fit 
in the triple-b-tag samples. The plot (a) shows the distribution of the dijet mass, M12, the plot 
(b) the distribution of the event b-tag variable X123 in the low-mass scenario. The hatched area 
at the edge of the summed background histogram corresponds to the uncertainty propagated 
from the templates. Bottom row: Dijet mass distribution in the medium-mass scenario, (d) with 
the background-only fit, and (d) including an additional signal template for a MSSM Higgs 
boson with a mass of 200 GeV. The fitted mass distribution of the Higgs contribution is shown 
a second time as the dashed histogram at the bottom of the figure. The fitted contribution of 
the (Qb)b template is compatible with zero within errors. 

strained through nuisance parameters as explained in section [6] The predicted background is 
shown in Fig.|4]for the two mass ranges, together with an expected signal for two Higgs boson 
masses at tan f> = 30. 



No significant deviation from background is observed in either analysis, and the CL S 
criterion is used to combine both results and determine the 95% confidence level (CL) limit on 
the signal contribution in the data, using the RooStats [38J package. To avoid correlations, 
in the all-hadronic analysis the events common to the semileptonic case are removed from the 
triple-b-tag samples. The fractions of events removed in the all-hadronic data samples are 2.3% 
and 2.7% for the low- and medium-mass scenarios, respectively. The requirement of a muon 
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Figure 4: Results from the semileptonic analysis. Data (red) and predicted background (blue) 
in the signal region, for (left) low-mass range (used for M<p < 180 GeV) and (right) medium- 
mass range (used for M^, > 180 GeV); the expected signal for different Ma and for tan /3 = 30 
in the m™ ax scenario, as described in the text, is also plotted. The difference between data 
and predicted background is also shown: the blue area represent the systematic and statistical 
uncertainties on the background prediction. 

in the semileptonic analysis and the harder kinematic selections of the all-hadronic analysis 
are responsible for such small overlap. Overlapping events in the simulated signal samples 
are also removed, although they are found to have negligible effect on the shape of the signal 
templates. 

Results are shown graphically in Fig. [5] in terms of cross section times branching fraction, and 
reported in Table |2j There is generally good agreement between the observed and expected 
upper limits within statistical errors, and no indication of a signal is seen. The observed upper 
limits range from about 312 pb at = 90 GeV to about 4 pb at M<p = 350 GeV. The individual 
observed limits of the two signatures are also displayed. The all-hadronic signature has a gen- 
erally larger signal efficiency, but requires higher thresholds for jet energies, while the presence 
of a muon in the semileptonic signature allows for lower thresholds at the cost of lower signal 
efficiency. As a result, both signatures are comparable in sensitivity. 

Figure [6] presents the results in the MSSM framework as a function of the MSSM parameters 
Ma and tan /3, combining the individual results of the two analyses, including all the statistical 
and systematic uncertainties as well as correlations. We use the MSSM m™ ax benchmark sce- 
nario [39. 40 j, which is designed to maximize the theoretical upper bound on Mh for a given 
tan /3 and fixed Msusy- Even though its parameters are under tension with the latest experi- 
mental results, it is currently still the most suitable benchmark scenario to compare the sensi- 
tivity of different analyses channels. The definition of theory parameters in the m™ ax bench- 
mark scenario is the following: M SUS y = 1 TeV; X t = 2M SU sy; H = 200 GeV; Mg = 800 GeV; 
M 2 = 200 GeV; and A b = A t ; M 3 = 800 GeV. Here, M SU sy denotes the common soft-SUSY- 
breaking squark mass of the third generation; X t = A t — }i/ tan /3 2 is the stop mixing parameter; 
A t and A^ are the stop and sbottom trilinear couplings, respectively; ]d is the Higgsino mass 



13 



parameter; Mg is the gluino mass; and M2 is the SU(2)-gaugino mass parameter. The value 
of Mi is fixed via the unification relation M\ = (5/3)M2sin$w/ cosflw- The expected cross 
section and branching fraction, in the MSSM framework, are calculated by BBH@NNLO |4l). in 
the 5-flavor scheme, and FEYNHlGGS |42l - |45l , respectively. Exclusion plots for two values of 
ji = ±200 GeV are shown. 

Figure [7] shows the results in the scenario with p = —200 GeV, together with previous limits 
set by Tevatron l|T2| in the multi-b jet final state, and by LEP [8|. In particular, no excess over 
the expected SM background is found for high values of tan j5 and for a resonance in the 100- 
150 GeV mass range, as previously reported by CDF and DO. The result of this work extends 
the sensitivity for MSSM searches in the (p — > bb decay mode to much lower values of tan f>, 
excluding the region where the excess was reported. 

The combined results reported in this Letter, using only the data collected at the LHC with a 
center-of-mass of \fs = 7 TeV, provides the most stringent limits on neutral Higgs boson decay 
in the bb mode, produced in association with b quarks. 




Figure 5: Observed and expected upper limits for the cross section times branching fraction 
at 95% CL, with linear (left) and logarithmic (right) scales, including statistical and systematic 
uncertainties for the combined all-hadronic and semileptonic results. One- and two-standard 
deviation ranges for the expected upper limit are also shown. 



8 Summary and conclusions 

We searched for a Higgs boson decaying into a pair of b quarks, produced in association with 
one or more additional b-quark jets. We used data samples corresponding to an integrated lu- 
minosity of 2.7-4.8 fb _1 collected in 2011 in proton-proton collisions at a center-of-mass energy 
of 7 TeV at the LHC. The data were collected with dedicated multijet triggers including b-tag 
selection, utilizing both all-hadronic and semileptonic event signatures. 

The search was performed on a triple-b-tag sample, using the invariant mass of two leading 
jets as a discriminating variable, with a prediction of the multijet background using control 
data samples. The all-hadronic analysis makes use of a second discriminating variable, X123, 
that reflects the heavy flavor content of the event. 
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8 Summary and conclusions 



Table 2: Expected and observed upper limits at 95% CL on 0"(pp — » b(p + X) x £>(</> — > 
bb), in pb, and on tan f> in the m™ ax benchmark scenario for two values of the parameter 
f/ = ±200GeV. 



M A (GeV) 


c(pp —?• b(p - 
expected 


f X) x B(cp bb) 
[pbl 

observed 


tan ft 
{]i = +200 GeV) 
expected observed 


tan ft 
{}i = -200 GeV) 
expected observed 


90 


486.3 


312.4 


28.2 




21.8 


23.4 


18.7 


100 


365.1 


163.2 


28.2 




17.7 


23.5 


15.7 


120 


172.1 


115.2 


25.7 




20.5 


22.0 


18.1 


130 


128.1 


104.5 


24.8 




21.9 


21.2 


19.1 


140 


92.0 


67.8 


25.1 




21.2 


21.3 


18.4 


160 


52.7 


38.3 


23.2 




19.5 


19.8 


17.0 


180 


34.4 


45.5 


23.5 




27.8 


19.9 


23.0 


200 


21.1 


19.8 


22.2 




21.6 


19.0 


18.5 


250 


13.5 


16.5 


29.1 




32.6 


23.7 


26.1 


300 


8.4 


10.9 


35.7 




42.2 


27.9 


31.8 


350 


5.8 


3.9 


44.0 




35.5 


33.0 


28.0 



No signal is observed above the SM background expectations, and 95% confidence level upper 
limits on the pp — > bcp + X, <p — > bb cross section times branching fraction are derived in the 90- 
350 GeV mass range. These results are interpreted, in the MSSM model and the m™ ax scenario, 
in terms of bounds in the space of the parameters, Ma and tan /3. The 95% confidence level 
bound on tan f> varies from about 18 to 42 in this Higgs boson mass range, thus excluding a 
region of parameter space previously unexplored for this final state. 
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Figure 6: Observed upper limits at 95% CL on tan j6 as a function of Ma, including the statistical 
and systematic uncertainties, in the m™ ax benchmark scenario, both for ]i = +200 GeV (left) and 
ji = —200 GeV (right), for the combined all-hadronic and semileptonic results. One- and two- 
standard deviation ranges for the expected upper limit are represented by the color bands. The 
expected upper limits for each of two signatures are also shown (dashed and dotted lines). 
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